# DPO Training
Summllama3.2 3B
Text summarization model initialized from Llama3.2-3B-Instruct, optimized through large-scale summarization feedback DPO training
Large Language Model
Transformers

S
DISLab
441
36
ECE TW3 JRGL V5
Apache-2.0
ECE-TW3-JRGL-V5 is a new model obtained by merging the MoMo-72B-lora-1.8.7-DPO and alpaca-dragon-72b-v1 models through mergekit, integrating the advantages of multiple models.
Large Language Model
Transformers

E
paloalma
10.59k
1
Featured Recommended AI Models